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Abstract. We present the variability processing and analysis that is 
foreseen for the Gaia mission within Coordination Unit 7 (CU7) of the 
Gaia Data Processing and Analysis Consortium (DPAC). A top level 
description of the tasks is given. 


1 Introduction 


Variable phenomena are common in the Univ erse and m ulti-epoch su rveys are pro¬ 
ducing very interesting scientific results (see lEver fe Mowlavil 2008). Among all 
the numerous existing surveys, Gaia is really prominent and will h ave a t remen- 
dous impact on Time Domain Astronomy (see for example lEver et all 120121 
2013h . Within the G aia Data Processing and Analysis Consortium (DPAC, see 


Mignard et all 12008 ). one Coordination Unit (CU7) is dedicated to the themat- 
ics of variability. The objective of CU7 is to populate the Gaia catalogue with 
properties of sources having photometric and/or spectral variability. 

CU7 is composed of about 70 researchers, postdocs, students, and computers 
scientists who are distributed among about 20 countries. 
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Fig. 1 . Schematic Gaia Variability Processing and Analysis Work Breakdown. 


2 Variability analysis Work Breakdown 


The top-level work-breakdown is represented in Fig. [lj We describe briefly the 
different tasks hereafter. 


Variability Detection: Variability detection comprises both statistical tests 
for the general detection of variability as well as specific algorithms that leverage 
knowledge of a specific variability behaviour to achieve a higher detection effi¬ 
ciency. These latter algorithms ensure variability detection in cases where general 
variability tests are not sufficient and in addition derive some parameters specific 
to the type of varia bility. The cases treated by such specific algorithms are: short 
time-scal es (IVaradi et ah . 2011), small amplitude perio dic variables, exo-planet ary 
transits (iTindeyl . 2011 : lDzig:an fc Zuckerl [2Q12L 120131) . solar (magnetic) variables 


(D istefano et ah . 20121) . 


Characterization: The variability behaviour is characterized using a classical 
Fourier decomposition approach; the goal being to find the simplest possible de¬ 
scription of the observed variability. Each source is scanned for periodicity and the 
most significant period, if significant enough, is retained and a Fourier series model 
fit. If the model residuals still show variability, according to the criteria defined 
above, an additional period search will be initiated to improve the final model. If 
no significant period is found, the final model will be a polynomial only and the 
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source is designated as non-periodically variable. The parameters of the model and 
all computed statistical parameters are used to construct the attributes for Classi¬ 
fication. A lot of attention was given to optimizing the period search computation 
to find the best compromise between speed and efficiency. Significant theoretical 
research went into the search for the most appropriate false-alarm probability for 
periodicity for these data sets. 

Classification: Once variables are identified and characterised, they go through 
different classification methods: Supervised, clustering methods and extractors 
(microlensing, transients, massive companions in binary systems). The result is a 
number of groups containing objects with similar variability behaviour (and most 
probably similar physical properties). Methods for extr actors and s upervised algo- 
rithms have been tested on Hipparcos data lDubath et all ( 2011 b iRimoldini et al. 

(I2012I) . 


Specific Object Studies: Once the variability type of a given source is 
assigned by Classification, specific algorithms are applied to validate it and to 
compute additional attributes. Some examples are: the Paczynski parameters of 
microlensing events; a sub-classification of eclipsing binaries; the computation of 
flare occurrence and magnitude for flaring stars, of rotation period and light-curve 
amplitude for rotation-induced variables, or of burst and outburst parameters 
for transients such as Be stars; the characterisation of pulsating stars such as 
long period variables, Cepheids, RR Lyrae, or of very short period variables like 
AM CVn or ZZ Ceti stars. 

Global Variability Studies: Subsequent to the source-by-source process¬ 
ing, all possible information about Gaia variables is available in the Variability 
Database. Several catalogue-level results such as the period distribution for all 
Cepheids, the colour-magnitude or HR diagram with iso-contour of variability 
amplitude, etc. are derived. The statistical properties of the Gaia Variability 
Database are studied compared to those of similar surveys in order to assess the 
quality of the Variability Database and to identify possible errors or problems 
in the data processing. Tools to aggregate, evaluate and visualize the database 
content have been developed. 

Supplementary Observations: Finally, there are cases where supplemen¬ 
tary observations, either from ground-based observatories, or from other satellites, 
will be required to complete our knowledge about the Gaia sources. The use of 
these observations is to help the development of the data processing or to test 
the output of the Gaia analysis. A network of small medium-sized telescopes was 
put in place and supplementary observations (photometry and/or spectroscopy) 
were collected for samples of Cepheids, RR Lyrae stars, Miras, short-period vari¬ 
ables and Be stars. The network of telescope is also actively participating in the 
validation of the Gaia Alerts stream. 


3 Geneva Data Processing Centre 

The software modules delivered by CU7 are described in Section 2, the releases of 
these modules are integrated at DPCG, the Geneva Data Processing Centre. The 
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software once integrated receives the name of VariPipe. During every processing 
cycle in Gaia a new release of VariPipe is produced and deployed in the process¬ 
ing clu ster in Gene va to obtain the scientific results out of the Gaia data (see 
iNienartowicz et ah . 1201 4). The DPCG is also the interface with the other data 
processing centres of the consortium and the Science Operations Centre. The 
data produced by the upstream CUs is received and prepared for its analysis, the 
software chains are run on the data and the produced results are prepared to be 
integrated in the Gaia catalogue. DPCG provides tools to analyze the input data 
as well as the results to ensure the quality of the delivered products to the Gaia 
catalogue. 


4 Releases 

A first global release of all variable sources detected among the one billion ob¬ 
served sources is foreseen in the 4th release (currently planned for 2018/2019), 
the final release might come in 2022. However the goal of CU7 is to release at 
earlier dates specific groups of variables, once the estimations of completeness and 
contamination are thought to be reliable enough and reach acceptable levels. 
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